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5^ , Abstract. In studying the enumerative theory of super characters' of the group of upper 

Mh' triangular matrices over a finite field we found that the moments (mean, variance and 

■^ , higher moments) of novel statistics on set partitions have simple closed expressions as linear 

combinations of shifted bell numbers. It is shown here that families of other statistics have 

similar moments. The coefficients in the linear combinations are polynomials in n. This 

allows exact enumeration of the moments for small n to determine exact formulae for all n. 
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1. Introduction 



^ ! The set partitions of [n] = {1,2, ■ ■ ■ ,n} (denoted n(n)) are a classical object of combi- 

natorics. In studying the character theory of upper-triangular matrices (see Section [3] for 
background) we were led to some unusual statistics on set partitions. For a set partition A 
of n, consider the dimension exponent 

O; d{X)■.= J2iM^-mi + l)-n 

^^ : -1 

,;;+ I where A has i blocks, M, and m, are the largest and smallest elements of the ith block. How 

O ' does d{X) vary with A? As shown below, its mean and second moment are determined in 

^^ terms of the Bell numbers Br, 



Y^ d{X) = - 2Bn+2 + (n + A)Bn+i 
Aen(n) 

c^_: Y^ d\\) =4Bn+4 - {An + 15)5„+3 + {n^ + 8n + 4)5„+2 - (4n + 3)fi„+i + n5„. 

Aen(n) 

The right hand sides of these formulae are linear combinations of Bell numbers with polyno- 
mial coefficients. Dividing by B„ and using asymptotitcs for Bell numbers (see Section 15. 3p 
in terms of «„, the positive real solution of we" = n + 1 (so «„ = log(n) — loglog(?2) + ■ ■ ■ ) 
gives 

Eid{X)) = (^]n' + 0(^ 



az / \a 



'■n 



This paper gives a large family of statistics that admit similar formulae for all moments. 
These include classical statistics such as the number of blocks and number of blocks of size 
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i. It also includes many novel statistics such as d{X) and Ck{X), the number of fc-crossings. 
The number of 2-crossings appears as the intertwining exponent of super characters. 

Careful definitions and statements of our main results are in Section [2l Section [3] reviews 
the enumerative and probabilistic theory of set partitions, finite groups and super-characters. 
Section Hlgives computational results; determining the coefficients in shifted Bell expressions 
involves summing over all set partitions for small n. For some statistics, a fast new algorithm 
speeds things up. Proofs of the main theorems are in Sections |5] and |6l Section [7] gives a 
collection of examples- moments of order up to six for d{X) and further numerical data. In 
a companion paper [14j, the asymptotic limiting normality of d{X), C2(A), and some other 
statistics is shown. 

2. Statement of the main results 

Let n(ra) be the set partitions of [n] = {1, 2, ■ ■ ■ , n} (so |n(r2)| = i?„, the nth Bell number). 
A variety of codings are described in Section [3l In this section A G Il{n) is described as 
A = B1IB2I ■ ■ ■ |B^ with Bj n Bj = 0, U^=iBj = [n]. Write i ~a j if i and j are in the 
same block of A. It is notationally convenient to think of each block as being ordered. Let 
First(A) be the set of elements of [n] which appear first in their block and Last(A) be the set 
of elements of [n] which occur last in their block. Finally, let Arc(A) be the set of distinct 
pairs of integers {i,j) which occur in the same block of A such that j is the smallest element 
of the block greater than i. As usual, A may be pictured as a graph with vertex set [n] and 
edge set Arc(A). 

For example, the partition A = 1356|27|4, represented in Figure[Tl has First(A) = {1,2, 4}, 
Last(A) = {6, 7, 4}, and Arc(A) = {(1, 3), (3, 5), (5, 6), (2, 7)}. 




12 3 4 5 6 7 

Figure 1. An example partition A = 1356|27|4 

A statistic on A is defined by counting the number of occurrences of patterns. This requires 
some notation. 

Definition 2.1. 

(z) A pattern P_ of length k is defined by a set partition P of [k] and subsets F(P), L(P) C 

[k] and A(P) C [k] x [k]. Let P = (P, F, L, A). 
(a) An occurrence of a pattern P of length fc in A G Il{n) is s = (xi, • • ■ , x^) with Xi G [n] 
such that 

(1) Xi < X2 < ■ ■ ■ < Xk- 

(2) Xi ~A Xj if and only if i ^p j. 

(3) X, G First(A) Hi G F(P). 

(4) Xi G Last (A) if i G L(P). 

(5) (x„x,)G Arc(A)if (^,j)eA(P). 
Write s ^p A if s is an occurrence of P in A. 
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[Hi) A simple statistic is defined by a pattern P of lengtli k and Q G '^[yi, ■ ■ ■ ,yk,fn\. If 
A G n(n) and s = (xi, ■ ■ ■ , x^) Gp A, write Q{s) = Q \y^=^^^^=^. Let 



/(A) = /p,q(A) := 5^ Q{s) 



Let the degree of a simple statistic /pq be the sum of the length of P_ and the degree 
ofQ. 
[iv) A statistic is a finite Q-linear combination of simple statistics. The degree of a 
statistic is defined to be the minimum over such representations of the maximum 
degree of any appearing simple statistic. 



Examples. 

(1) Number of Blocks in A : 



^(A) = E 



l<.x<.n 
X is smallest element in its block 



Here P is a pattern of length 1, F(P) = {!}, L(P) = A(P) = and Q{y,m) = 1. 
Similarly, the nth moment of i{X) can be computed using 

where P^i. is the pattern of length k corresponding to P, the partitions of [k] into 
blocks of size 1, with F{P^) = {1, 2, ■ ■ ■ , k}, and L{P^) = A{P^) = 0. 

(2) Number of blocks of size i : Define a pattern Pj of length i by: (1) all elements of [i] 
are equivalent, (2) F(P,) = {1}, (3) L(PJ = {z}, (4) A(PJ = {(1, 2), ■ ■ ■ , (z - 1,^}. 
Then /p ,i(A) is the number of i-blocks in A. (If i = I, A(P]^) = 0.) Similarly, the 
moments of the number of blocks of size i is a statistic. See Theorem 12.21 

(3) ^-crossings: A A;-crossing [13] of a A G Il{n) is a sequence of arcs {it,jt)i<t<k G Arc(A) 
with 

ii < i2 < ■ ■ ■ < ik < ji < J2 < ■ ■ ■ < jk- 

The statistic crfc(A) which counts the number of fc-crossings of A can be represented 
by a pattern P = (P, F, L, A) of length 2k with (1) z ~p fc + i for i = 1, • ■ ■ , fc, (2) 
F(P) = L(P) = 0, (3) A(P) = {(1, A; + 1), (2, A; + 2), ■ ■ ■ , {k, 2k)}. 

Partitions with cr2(A) = are in bijection with Dyck paths and so are counted 
by the Catalan numbers C„ = ^^ ( ^) (see Stanley's second volume on enumera- 
tive combinatorics [58]). Partitions without crossings have proved themselves to be 
very interesting. See p^ and [44j- The statistic cr2(A) appears as the intersection 
exponent in Section [3131 below. 

(4) Dimension Exponent: The dimension exponent described in the introduction is a 
linear combination of the number of blocks (a simple statistic of degree 1), the last 
elements of the blocks (a simple statistic of degree 2), and the first elements of the 
blocks (a simple statistic of degree 2). Precisely, define //irsts(A) := /p,q(A) where 
P is the pattern of length 1, with F(P) = {1}, L(P) = A(P) = and 'Q{y, m) = y. 
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Similarly, let fiasts{^) '■= fp,QW where P is the pattern of length 1, with L(P) = {1}, 
F(P) = A(P) = and Q(jj,m) = y. Then 

d{X) = flastsW - ffirstsW + ^(A) - U. 

(5) The niaxiniuni block size of a partition is not a statistic in this notation. 
The set of all statistics on U^gn(n) — )■ Q is a filtered algebra. 

Theorem 2.2. Let S be the set of all set partition statistics thought of as functions f : 
IJ^n(n) — )■ Q. Then S is closed under the operations of pointwise scaling, addition and 
multiplication. In particular, if fi, f2 G <S and a G Q, then there exist partition statistics 
Qa, g+1 9* so that for all set partitions X, 



afiiX) 


= 9a{X) 


/i(A) + /2(A) 


= 9+{X) 


/i(A)-/2(A) 


= 9*{X). 



Furthermore, deg{ga) < deg(/i), deg(5f+) < max(deg(/i), deg(/2)), and deg{g^) < deg(/i) + 
deg(/2). In particular, S is a filtered Q-algebra under these operations. 

Remark. Properties of this algebra remain to be discovered. 

Definition 2.3. A shifted Bell polynomial is any function P : N — )■ Q given by 

Rii^) = X] Qj{n)Bn+j 

I<j<K 

where I,K ^ Z and each Qj{x) G Q[x]. i.e. it is a finite sum of polynomials multiplied by 
shifted Bell numbers. Call K the upper shift degree of P and / the lower shift degree of P. 

Our first main theorem shows that the aggregate of a statistic is a shifted Bell polynomial. 

Theorem 2.4. For any statistic, f of degree N, there exists a shifted Bell polynomial R 
such that for all n > 1 

M{f-n):= J2 f{X) = R{n). 
Aen(n) 
Moreover, 

(1) the upper shift index of R is at most N and the lower shift index is bounded below by 
—k, where k is the size of the pattern associated f . 

(2) the degree of the polynomial coefficient of Bn+N-j in R is bounded by j for j < N 
and by j — 1 for j > N. 

Remark. Preliminary work shows that it is also possible to generalize our definition of sta- 
tistics to include "adjacency" conditions. The results of Theorems 12.21 and 12.41 hold for these 
generalized statistics. 

As an example, consider the number of levels, rises, and descents, which are natural in the 
restricted growth sequence encoding of a set partition. The number of levels in restricted 
growth sequence A = (ai, 02, ■ ■ ■ , a„) is the number of i such that Oj = Oj+i. Denote this by 
fieveis{X). In other words, it is the number adjacent parts which appear in the same block. 
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The number of ascents and descents are defined similarly and are contained in this class of 
generalized statistics. Shattuck [53J, see also Chapter 4 of |12], showed, for instance, that 

M{f levels', n) = 2_^ flevelsW = " (-Bn+1 — Bn — -Bn-l) • 

Aen{n) 

It is amusing that this implies that B^n = B^n+i = 1 (mod 2) and i?3„+2 = (mod 2) for 
all n > 0. 

3. Set Partitions, Enumerative Group Theory and Super-characters 

This section presents background and a literature review of set partitions, probabilistic 
and enumerative group theory and super-character theory for the upper triangular group 
over a finite field. Some sharpenings of our general theory are given. 

3.1. Set Partitions. Let 11 (n, /c) denote the set partitions of n labelled objects with k 
blocks and n(n) = Ufcn(n, k); so |n(n, k)\ = S{n, k) the Stirling number of the second kind 
and |n(^)| = Bn the nth Bell number. The enumerative theory and applications of these 
basic objects is developed in Graham-Knuth-Patashnick [30], Knuth [39], Mansour |42] and 
Stanley [57]. There are many familiar equivalent codings 

• Equivalence relations on n objects with k blocks 

1|2|3, 12|3 , 13|2 , 1|23 , 123 

• Binary, strictly upper-triangular zero-one matrices with no two ones in the same row 
or column. (Equivalently, rook placements on a triangular Ferris board (Riordan 

m 





Arcs on n points 

... r\ . /T^ . r\ r^\ 

123 123 123 123 123 

5 7 9 7 

Restricted growth sequences ai, 02, ... , a^; cti = 0, aj+i < 1 + max(ai, . . . , aj) for 
l<j<n (Knuth [39j, p. 416) 

012 , 001 , 010 , Oil , 000 







Semi-labelled trees on n + 1 vertices 

Vacillating Tableau: A sequence of partitions A", A^, ■ ■ ■ , A^" with A*^ = A^" = and 
^2i+i jg obtained from A^* by doing nothing or deleting a square and A^* is obtained 
from A^*^^ by doing nothing or adding a square (see [T3]). 
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A A 



1 2 




1 2 3 



[wu 



The enumerative theory of set partitions begins with Bell polynomials. Let B^^kiwi, 
^Aenrnfc) ri'W^j with Xj(A) the number of blocks in A of size i; so set B„ 

niula gives 



,Wr. 



Wn) 



~Wr. 



and B{t) = Yl^=o ^'"■i'Ml)^ ■ A classical version of the exponential for- 



(3.1^ 



Bit) 



Eoc 
n- 



These elegant formulae have been used by physicists and chemists to understand fragmen- 
tation processes (|19| for extensive references). They also underlie the theory of polynomials 
of binomial type [29| [38] , that is, families P„(x) of polynomials satisfying 

Pnix + y) = J2 Pk{x)Pn^k{y)- 

These unify many combinatorial identities, going back to Faa de Bruno's formula for the 
Taylor series of the composition of two power series. 

There is a healthy algebraic theory of set partitions. The partition algebra of [3l] is based 
on a natural product on 11 (n) which first arose in diagonalizing the transfer matrix for the 
Potts model of statistical physics. The set of all set partitions Un-ni?^) has a Hopf algebra 
structure which is a general object of study in [3j. 

Crossings and nestings of set partitions is a emerging topic, see |35[ |3U [T3| and their 
references. Given A G n(n) two arcs (ii, ji) and (^2; J2) are said to cross if ii < 12 < ji < J2 
and nest if ii < 12 < J2 < ji- Let cr(A) and ne{X) be the number of crossings and nestings. 
One striking result: the crossings and nestings are equi-distributed ([3S] Corollary L5), they 
show 

Aen(n) Aen(n) 

As explained in Section [3l3] below. crossings arise in a group theoretic context and are covered 
by our main theorem. Nestings are also a statistic. This crossing and nesting literature 
develops a parallel theory for crossings and nestings of perfect matchings (set partitions 
with all blocks of size 2). Preliminary works suggest that our main theorem carry over to 
matchings with B^ reduced to {2n)\/2'^n\. 

Turn next to the probabilistic side: What does a 'typical' set partition 'look like'? For 
example, under the uniform distribution on 11 (n) 

• What is the expected number of blocks? 

• How many singletons (or blocks of size i) are there? 

• What is the size of the largest block? 

The Bell polynomials can be used to get moments. For example: 
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Proposition 3.1. 

(z) Let £{X) be the number of blocks. Then 

mii;n):= J^ ^(A) = fin+i - 5, 
Aen(n) 

m{f; n) =5„+2 - 35„,+i + B„, 

m(£ ; n) =Bn+3 — 6i?„,+2 + 8-B„+ii?„,+i — i?„ 

(a) Let Xi{X) be the number of singleton blocks, then 

m{Xi]n) =nBn-i 

m{Xl] n) =nBn^i + n{n - 1)5.„_2 

Remark. In accordance with our general theorem, the right hand sides of (i), (m) are shifted 
Bell polynomials. To make contact with results above, there is a direct proof of these classical 
formulae. 

Proof. Specializing the variables in the generating function (13.11) gives a two variable gener- 
ating functions for t. 

n=0 Aen(n) n>0 

Differentiating with respect to y and setting y = 1 shows that m(£; n) is the coefficient of ^ 
in (e^ — l)e'^''~^. Noting that 

n.=0 

yields m(£) = -B„+i — i?n- Repeated differentiation gives the higher moments. 
For Xi, specializing variables gives 

°° n 

n=0 Aen(n) 

Differentiation with respect to y and settings y = I readily yields the claimed results. D 

The moment method may be used to derive limit theorems. An easier, more systematic 
method is due to Fristedt |27| . He interprets the factorization of the generating function 
B{t) in (13. ip as a conditional independence result and uses "dePoissonization" to get results 
for finite n. Let Xi[X) be the number of blocks of size i. Roughly, his results say that 
{^i}7=i ^^6 asymptotically independent and of size (log(ra))Y'i!. More precisely, let a„ satisfy 
Q^nC"" = n + 1 (so a„ = log(n) — loglog(n) + o(l)). Let /3j = a^/z! then 

P{^^<x}=<l>(x)+o(l) 

y Pi 

where $(x) = -h= f^^ e"" ^'^du. Fristdt also has a description of the joint distribution of the 
largest blocks. 
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The number of blocks i{X) is asymptotically normal when standardized by its mean /i„ ~ 
j^^^T^ and variance a^ ~ kT^Oni' These are precisely given by Proposition 13. II above. Refining 
this, Hwang [32J shows 



P{^,.H.(.),o(!^) 



Stam [55j has introduced a clever algorithm for random uniform sampling of set partitions 
in n(n). He uses this to show that if W{i) is the size of the block containing i, 1 < i < k, 
then for k finite and n large W{i) are asymptotically independent and normal with mean 
and variance asymptotic to q;„. In [11] we use Stam's algorithm to prove the asymptotic 
normality of d{X) and c(A). 

Any of the codings above lead to distribution questions. The upper-triangular represen- 
tation leads to the study of the dimension and crossing statistics, the arc representation 
suggests crossings, nestings and even the number of arcs, i.e. n — i{X). Restricted growth 
sequences suggest the number of zeros, the number of leading zeros, largest entry. See Man- 
sour ||42j for this and much more. Semi-labelled trees suggest the number of leaves, length 
of the longest path from root to leaf and various measures of tree shape (eg. max degree). 
Further probabilistic aspects of uniform set partitions can be found in 



3.2. Probabilistic Group Theory. One way to study a finite group G is to ask what 
'typical' elements 'look like'. This program was actively begun by Erdos and Turan [T9| [20| 
[2T| [22| [23| [211 [25] who focused on the symmetric group Sn- Pick a permutation cr of n at 
random and ask the following: 

• How many cycles in a? (about logn) 

• What is the length of the longest cycle? (about 0.61n) 

• How many fixed points in a? (about 1) 

• What is the order of a? (roughly 6^'°^"^ /^) 

In these and many other cases the questions are answered with much more precise limit 
theorems. A variety of other classes of groups have been studied. For finite groups of Lie 
type see |28j for a survey and [15j for wide-ranging applications. For p-groups see |46| . 

One can also ask questions about 'typical' representations. For example, fix a conjugacy 
class C (e.g. transpositions in the symmetric group), what is the distribution of Xp{C) as p 
ranges over irreducible representations [28] \3E\ [5^ . Here, two probability distributions are 
natural, the uniform distribution on p and the Plancherel measure (Pr(p) = d'i/\G\ with dp 
the dimension of p). Indeed, the behavior of the 'shape' of a random partition of n under the 
Plancherel measure for Sn is one of the most celebrated results in modern combinatorics. See 
Stanley's [56j for a survey with references to the work of Kerov-Vershik [37J, Logan-Shepp 
|40] . Baik-Deift- Johansson [lOj and many others. 

The above discussion focuses on finite groups. The questions make sense for compact 
groups. For example, pick a random matrix from Haar measure on the unitary group f/„ 
and ask: What is the distribution of its eigenvalues? This leads to the very active subject 
of random matrix theory. We point to the wonderful monographs of Anderson-Guionnet- 
Zietouni [5] and Forrester [26] which have extensive surveys. 
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3.3. Super-character theory. Let Gn{q) be the group oi n x n matrices which are upper 
triangular with ones on the diagonal. The group Gn{q) is the Sylow p-subgroup of GL„(F^) 
for q = p"". Describing the irreducible characters of Gn{q) is a well-known wild problem. 
However, certain unions of conjugacy classes, called superclasses, and certain characters, 
called supercharacters, have an elegant theory. In fact, the theory is rich enough to provide 
enough understanding of the Fourier analysis on the group to solve certain problems, see 
the work of Arias-Castro, Diaconis, and Stanley [9]. These superclasses and supercharacters 
were developed by Carlos Andre [6l [TJ [8] and Ning Yan |60]. Supercharacter theory is a 
growing subject. See j21 HI [HI [13 [121 [B] and their references. 

For the groups Gn{q) the supercharacters are determined by a set partition of [n] and a 
map from the set partition to the group F* In the analysis of these characters there are 
two important statistics, each of which only depends on the set partition. The dimension 
exponent is denoted d{X) and the intertwining exponent is denoted i{X). 

Indeed if Xx and x^ are two supercharacters then 

dim ixx) = q'^'^ and (xa, X,) = K,<t^^^ ■ 

While d{X) and i{X) were originally defined in terms of the upper triangular representation 
(for example, d{X) is the sum of the horizontal distance from the 'ones' to the super diagonal) 
their definitions can be given in terms of blocks or arcs: 

(3.2) d{X):= Y. (/-e-1) 

e'--/eArc(A) 

and 

(3.3) ^{\)■.= Yl 1 

ei<e2</i</2 
ei-^/iSArc^A) 
e2'--/2eArc(A) 

Remark. Notice that i{X) = cr2(A) is the number of 2-crossings which were introduced in 
the previous sections. 

Our main theorem shows that there are explicit formulas for every moment of these sta- 
tistics. The following represents a sharpening using special properties of the dimension 
exponent. 

Theorem 3.2. For each k G {0, 1, 2, ■ ■ ■ } there exists a closed form expression 

M{d^; n) := ^ d(A)'= = Pk,2k{n)Bn+2k + Pk,2k-i{n)Bn+2k-i + ■■■ + Pkfl{n)Bn 
Aen(n) 

where each Pk,2k-j is a polynomial with rational coefficients. Moreover, the degree of Pk,2k-j 
is 

3 J <k 

k-\i^] j>k- 
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For example, 

Y, d{X) = - 2fi„+2 + {n + 4)5„+i 
Aen{n) 

^ d{\f =45„+4 - (4n + 15)S„+3 + (n^ + 8n + 9)B„,+2 - (4n + 3)5„+i + nS„ 

AGn{n) 

Remark. See Section [7]for the moments with k <Q and see |^ for the moments with k < 19. 
The first moment may be deduced easily from results of Bergeron and Thieni |II]. Note, 
they seem to have an index which differs by one from ours. 

Remark. Theorem 13.21 is stronger than what is obtained directly from Theorem 12.41 For 
example, the lower shift index is 0, while the best that can be obtained from Theorem 12.41 is 
a lower shift index of —k. This theorem is proved by working directly with the generating 
function for a generalized statistic on "marked set partitions". These set partitions are 
introduced in Section HI 

Asymptotics for the Bell numbers yield the following asymptotics for the moments. The 
following result gives some asymptotic information about these moments. 

Theorem 3.3. Let an = log(ra)— loglog(n) + o(l) be the positive real solution ofue^ = n + 1. 
Then 



E {d{X)) = (^^) n' + O {na-') . 



\k 



Let Sk{d] n) := X^Aenfn) i^W ~ M[d] n)/ Bn) he the symmetrized moments of the dimension 
exponent. Then 



Remark. Asymptotics for Sk{d;n) with k = 1,2,3,4,5,6 and with further accuracy are in 
Section [71 

Analogous to these results for the dimension exponent are the following results for the 
intertwining exponent. 

Theorem 3.4. For each k G {0, 1, 2, ■ ■ ■ } there exists a closed form expression 

M{i^;n):= ^ i{X)'' = QkMn)Bn+2k + ■ ■ ■ + Qk,o{n)Bn + ■ ■ ■ + Qk,-k{n)Bn-k 
Aen(n) 
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where each Qk,2k-j is a polynomial with rational coefficients. Moreover, the degree of Qk,2k-j 
is bounded by j . For example, 

M{t; n) =i ((2n + 1)5„ + {2n + 9)fi„+i - 55„+2) 

M{i^; n) =^ ((36^2 + 2An - 23)5„ + (72^2 + 72n - 260)5„+i 

+ {36n^ + 156n + 489)5„+2 - (180n + 814)B„+3 + 2255„+4) . 

Remark. Theorem 13.41 is deduced directly from Theorem 12.41 The shifted Bell polynomials 
for M{i''; n) for A; < 5 are given in Section [71 

Remark. Amusingly, the formula for M{i; n) implies that the sequence {i?„}^Q taken modulo 
4 is periodic of length 12 beginning with {1, 1, 2, 1, 3, 0, 3, 1, 0, 3, 3, 2}. Similarly, the formula 
for M{i'^;n) shows that the sequence is periodic modulo 9 (respectively 16) with period 39 
(respectively 48). For more about such periodicity see the papers of Lunnon, Pleasants, and 
Stephens [5T] and Montgomery, Nahm, and Wagstaff |47) . 

In analogy with Theorem 13.31 there is the following asymptotic result. 

Theorem 3.5. With an as above, 



E{^{\))=[^-^]n' + 0{na-'). 



Let Sk{i; n) = X^AGnn (^('^) ~ ^i(^' n)/Bn) . Then, 

n I- \ 3a;^ - 22a;„ + 56 3 , „. 2 -\\ 



{an - 5) (4^3 _ 31^2 ^ ^QQ^^ ^ gg)^^ 

^aiian^r? 



3„-3\ 



Sz[%\n)= ^— ^^^^ n +0(na, 



Theorems 13.21 and 13.41 show that there will be closed formulas for all of the moments 
of these statistics. Moreover, these theorems give bounds for the number of terms in the 
summand and the degree of each of the polynomials. Therefore, to compute the formulas it 
is enough to compute enough values for M{d^] n) or M{i^:, n) and then to do linear algebra 
to solve for the coefficients of the polynomials. For example, M{d;n) needs Pi^2{n) which 
has degree at most 0, Pi,i{n) which has degree at most 1, and Pifi{n) which has degree at 
most 0. Hence, there are 4 unknowns, and so only M{d;n) for n = 1,2,3,4 are needed to 
derive the formula for the expected value of the dimension exponent. 

4. Computational Results 

Enumerating set partitions and calculating these statistics would take time 0{Bn) (see 
Knuth's volume [39j for discussion of how to generate all set partitions of fixed size, the book 
of Wilf and Nijenhuis [45j, or the website [52] of Ruskey). This section introduces a recursion 
for computing the number of set partitions of n with a given dimension or intertwining 
exponent in time 0{n'^). The recursion follows by introducing a notion of "marked" set 
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partitions. This generalization seems useful in general when computing statistics which 
depend on the internal structure of a set partition. The results may then be used with 
Theorems 13.21 and 13.41 to find exact formulas for the moments. Proofs are given in Section |5l 
For a set partition A mark each block either open or closed. Call such a partition a marked 
set partition. For each marked set partition A of [n] let o(A) be the number of open blocks of 
A and i{X) be the total number of blocks of A. (Marked set partitions may be thought of as 
what is obtained when considering a set partition of a potentially larger set and restricting 
it to [n]. The open blocks are those that will become larger upon adding more elements of 
this larger set, while the closed blocks are those that will not.) With this notation define 
the dimension of A with blocks Bi, B2, • • ■ by 



(4.1) 



d{\) 



( 



\B. 



\ 



y max(Bj) 



is closed 



^ min(B,) + £(A) + n (o(A) - 1) 



/ 



It is clear that if o(A) = 0, then A may be thought of as a usual 'unmarked' set partition 
and d(X) = d{X) is the usual dimension exponent of A. Define 

(4.2) f{n; A,B):={Xe U{n) : o(A) = A and d{X) = B} 

Theorem 4.1. For n > 

f{n-A,B)=f{n-l;A-l,B^A + l) + f{n~l;A,B-A) 

+ Af{n -1;A,B-A + 1) + {A + l)f{n -1;A + 1,B-A). 

with initial condition /(O; A, B) = for all {A, B) 7^ (0, 0) and /(O; 0, 0) = 1. 

Therefore, to find the number of partitions of [n] with dimension exponent equal to k, it 
suffices to compute f{n,0,k) for k and n. Figure [2] gives the histograms of the dimension 
exponent when n = 20 and n = 100. With increasing n, these distributions tend to normal 
with mean and variance given in Theorem 13.31 This approximation is already apparent for 
71 = 20. 
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Figure 2. Histograms of the dimension exponent counts for n = 20 and n = 100. 



SET PARTITION STATISTICS: DIMENSION AND INTERTWINING EXPONENT 13 

It is not necessary to compute the entire distribution of the dimension index to compute 
the moment formulas for the dimension exponent. Namely, it is better to implement the 
following recursion for the moments. 

Corollary 4.2. Define Mk{d;n,A) := ^X(.n{n) d{X)^ . Then 

o(X)=A 

Mk{d; n,A)=J2 T ) (^ - I)'"' Mj{d; n - 1,A - 1) + J2 TA A''-^Mj{d] n-l,A) 

+ ^^ [^Va - l)'=-^M,(d; n-l,A) + {A + l)J2 (^) A'^'^^id; n-l,A + l) 

To compute M{d^\ n), then for each m < n this recursion allows us to keep only k values 
rather than computing all 0{ni ■ rn^) values of f{m,A,B). To find the linear relation of 
Theorem 13.21 only 0{k ■ k"^) values of Mk{d] n, A) are needed. 

In analogy, there is a recursion for the intertwining exponent. Let /(j)(n, A, S) be the 
number of marked partitions of [n] with intertwining weight equal to B and with A open 
sets where the intertwining weight is equal to the number of interlaced pairs i '^ j and k ^~^ i. 
where /c is in a closed set plus the number of triples ^, /c, j such that i ^- j and k is in an 
open set. 

Theorem 4.3. With the notation above, the following recursion holds 

f(,){n + 1, A, B) =f^,){n, A, B) + /(,)(n, A-l,B) 

A A-l 

+ 5^/(,)(n,A+l,5-j) + 5^/w(n,A5-j)- 

j=0 j=0 

This recursion allow the distribution to be computed rapidly. Figure [3] gives the histograms 
of the intertwining exponent when n = 20 and n = 100. Again, for increasing n the 
distribution tends to normal with mean and variance from Theorem 13.51 The skewness is 
apparent for n = 20. 
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Figure 3. Histograms of the dimension exponent counts for n = 20 and n = 100. 
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5. Proofs of Recursions, Asymptotics, and Theorem 13.21 

This section gives the proofs of the recursive formulas discussed in Theorems 14.11 and 14.31 
Additionally, this section gives a proof of Theorem 13.21 using the three variable generating 
function for f{n,A,B). Finally, it gives an asymptotic expansion for Bn+k/Bn with k fixed 
and ra — 7- oo. This asymptotic is used to deduce Theorems 13.31 and 13.51 

5.1. Recursive Formulas. This subsection gives the proof of the recursions for /(n. A, B) 
and f(i){n, A, B) given in Theorems 14. II and 14. 31 The recursion is used in the next subsection 
to study the generating function for the dimension exponent. 



Proof of Theorem \4.1\ The four terms of the recursion come from considering the following 
cases: (1) n is added to a marked partition of [n — 1] as a singleton open set, (2) n is added 
to a marked partition of [n — 1] as a singleton closed set, (3) n is added to an open set of 
a marked partition of [n — 1] and that set remains open, (4) n is added to an open set of a 
marked partition of [n — 1] and that set is closed. D 

Proof of Theorem \4.!^ The argument is similar to that of Theorem 14. II The same four cases 
arise. However, when adding n to an open set the statistic may increase by any value j and 
it does so in exactly one way. D 

5.2. The Generating Function for f{n,A,B). This section studies the generating func- 
tion for f{n,A,B) and deduces Theorem 13.21 Let 

(5.1) FiX,Y,Z):= J2 fin;A,B)^Y^Z^ 

n,A,B>0 

be the three variable generating function. Theorem 14.11 implies that 

(5.2) ^^(^' ^' ^) = (1 + ^) (^(^' ^^' ^) + ^^(^' ^^' ^)) ' 

where Fy denotes -^P- 

Then F{X, 0, Z) is the generating function for the distribution of d{X), i.e. 

n=0 AGn(n) 



Thus, the generating function for the A;th moment is 

k 

\Z=1,Y=Q' 



n>0 

Consider 



J2M(d'':n)-=(z-\ F(X,Y,Z)\ 



(5.3) FkiX,Y) ■.= (Z—] FiX,Y,Z)\^^^. 



d ^' 



dZ 



SoFk{X,0) = J2M{d'';n 



X" 
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Lemma 5.1. In the notation above, 



(^-(i+^-)|^)^»(-'^-^-)-(^+^-' !:(:)( 



d\''/_ d 



'W i + aFl^-*^-^')!' 



Proof. From (15. 2p . 

_d_ 
dX 



Fjx,Y) ^ (1 . r,i: Q ((r|,)' (i . ^) F„_.(x,.V 



Hence solving for Fn gives 
(5.4) 

^ - (1 H- r) A) ^.(x, r) ^ (1 . n E (:) ( (^■^) ' -^ ^ I ^-(■'^. >■) 

n 

Throughout the remainder F = e" — 1. Abusing notation, let 

Gfc(X, a) := GkiX, Y) := Fk{X, Y) exp (-(1 + F)(e^ - 1)) . 

The following lemma gives an expression for Gk{X,a) in terms of a differential operators. 
Define the operators 

dX da 

T :=^ + e^+°. 
Lemma 5.2. Clearly Go{X,Y) = 1. Moreover, 

Proof. (15. 4p is equivalent to 



^ + (l + y)e--(l + F)(— + e-))G„(X,r) 



fc>0 



=-^-'S(:)0-(^ 



Now 



where a e" has been commuted through. Then 

(5.5) RGkiX,a) = J2(^^ {T - TS'^ - S)' TG^-,. 
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Since Gk{0, a) = ioi k > 0, 

(5.6) Gk{X,a)= [ y2(^^{T-TS-'-SyTGk-iit,X + a-t)dt. 

Jo ^>o V/ 

From this 

G,{X,a) = J2cUcS''T''X% 

a,b,c 

for some constants CK^- D 



The next lemma evaluates the terms in the summation of Lemma 15. 2[ thus yielding a 
generating function for Gk{X, Y) which resembles that for the Bell numbers. 

Lemma 5.3. 

(r'l)|„„exp(e--l)=^B„«ir. 

n>0 

Proof. It is easy to see by induction on i that T^l is a polynomial in e"^"^". Thus 

Hence 

From this, it is easy to see that 

And the result follows. D 

Lemmas l5.2l and [5.3l readily yield the following expression for the moments of the dimension 
exponent as a shifted Bell polynomial. 

Lemma 5.4. For each k > and n > 

Mid"; n) = J2 CafiA^ -l)---{n-c+ l)5„+,_,. 

a,b,c 

Theorem 13.21 needs some further constraints on the degrees of terms in this polynomial. 
The following lemma yields the claimed bounds for the degrees. 

Lemma 5.5. In the notation above, C^bc — unless all of the following hold: 

(1) c<b. 

(2) c < b unless a = 0. 

(3) b < 2k. 

(4) 3c~b<k. 

(5) 3c-6< A;-2 z/a^O. 
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Proof. Let i7a,b,c(X,a) = S'^T^X'^l. Using Equation (EB]), write C^^,^^ in terms of the C^^^^^ 
for i < k. To do this requires understanding 

/ Ha,bAt,X + a-t)dt. 
Jo 

As a first claim: if a = 0, then the above is simply ■^Hom,c+i- This is seen easily from the 
fact that R commutes with T. For a 7^ 0, it is easy to see that this is a linear combination 
of the Ha,b,c' over d < c, and of -ffo,fe',o over b' < b. 

The desired properties can now be proved by induction on k. It is clear that they all hold 
for k = 0. For larger k, assume that they hold for all k — i, and use Equation 15.61 to prove 
them for k. 

By the inductive hypothesis, the TG^-e are linear combinations of i?a,6,c with c < b. Thus 
(T — TS~^ — S) TGk-e is a linear combination of Ha,b,c^ with b > c. Thus, by Equation 
(15. 6p . Gk is a linear combination of Ha,b,c^ with c <b and a = or with c < b. This proves 
properties 1 and 2. 

By the inductive hypothesis the Gk-e are linear combinations of Ha^b,c with b < 2{k — i). 
Thus (T — TS^^ — S) TGk-f, is a linear combination of Ha^J^ with b<2k + l— (!,< 2k. 
Thus, by Equation (15. 6p . Gk is a linear combination of Ha,b,c^ with 6 < 2k. This proves 
property 3. 

Finally, consider the contribution to Gk coming from each of the Gk-t terms. For £ = 1, 
Gk-i is a linear combination of Ha,b,c^ with 3c — 6< A; — 3 if 07^ 0, 3c — 6< A; — lif 
a = 0. Thus TGk-i is a linear combination of Ha^b,c^ with 3c — 6 < /c — 3 if a 7^ 0, and 
3c — 6 < k — 2 otherwise. Thus, (T — TS~^ — S) TGk-e is a linear combination of Ha^b,cS 
with 3c — 6< A; — 3ifa = 0, and 3c — b < k — 2 otherwise. Thus the contribution from 
these terms to Gk is a linear combination of Ha^b,cS with 3c — b < k and 3c — b < k — 2 ii 
a 7^ 0. For the terms with i > 1, Gk-e is a linear combination of Ha^b,c^ with 3c — b < k — 2 
and 3c — 6 < A; — 4 when a 7^ 0. Thus, TGk-e is a linear combination of Ha^b,c^ with 
3c — 6 < A; — 3, as is (T — TS'^ — S) TGk-e- Thus, the contribution of these terms to Gk 
is a linear combination of Ha^b,c^ with 3c — b < k and 3c — 6 < A; — 3 if a 7^ 0. This proves 
properties 4 and 5. 

This completes the induction and proves the Lemma. D 



From this Lemma, it is easy to see that 



2k 



M{d'-n) = Y,Bn+ePkA^) 



e=o 



for some polynomials Pk/{n) with deg(Pfc,f) < min(2A; — £, A;/2 + C/2). 

5.3. Asymptotic Analysis. This section presents some asymptotic analysis of the Bell 
numbers and ratios of Bell numbers. These results yield Theorems 13.31 and 13.51 Similar 
analysis can be found in |39] . 



Proposition 5.6. Let an he the solution to 

we" = n + 1 



18 BOBBIE CHERN, PERSI DIACONIS, DANIEL M. KANE, AND ROBERT C. RHOADES 

and let 

._ „„ / 1 \ A; _ (n + l)K + l) + fc 
C,n,k ■- e liH IH - . 

Then 

5„,+fc = ^-^^U exp (e"" -(n + k + l) log(a„)) (l + O (e""")) . 
\/2ne 



More precisely, for T > 



5„+fc = ^^^Cl exp (e"" -(n + k + l) log(a.)) 
v27re 



V m=l ^ 



T+1 



where Rm,k c^^e rational functions. In particular 



((-12A:2 + 24A; - 2) + (-24A;2 + 2Ak + 18)u + (-12A;2 - 12/;; + 20)n2 + {-12k + 3)u^ - 2n^) 

^^''^^^^ = 24(^7TTp 

„ , , (144A;^ - 384A;3 + 624A;2 - 1152A; + 100) + (576A:^ - 576k^ + SlGk"^ - 3264A; - 648)n 
'^ ^ 1152(n + l)6 

(864A:'' + 1056/c3 + 432A;2 - 6384/c - 1292)^2 



+ 



1152(u + l)^ 
(576A;^ + 2784/c3 + 2280/^2 - 7440A: - 2604)^3 



1152(n + l)6 



{UAk'^ + 2016/c3 + 3888A;2 - 3552k - 2988)u^ + (480A;3 + 2328/0^ + 72A; - 1800)n" 



+ 



1152(n + l)< 
(480A;2 + 600/fc - 551)^^ + (144A; - 60)u^ + 4n^ 
1152(u + l)6 



Proof. The proof is very similar to the traditional saddle-point method for approximating 
Bn. The idea is to evaluate at the saddle point for Bn rather than for Bn+k- We follow the 
proof in Chapter 6 of [18] . 
By Cauchy's formula, 

[n + k)\ Jc 

where C encircles the origin once in the positive direction. Deform the path to a vertical line 
u — ioo to n + ioo by taking a large segment of this line and a large semi-circle going around 
the origin. As the radius, say R, is taken to infinity the factor z~^~^~^ = 0{R''"'~'^~^) and 
exp(e^) is bounded in the half-plane. 
Choose u = On and then 

2716 f^ 

————Bn+k = exp (e"" -{n + k + l) log(a„)) / exp {tpnAv)) dy 
[n + K)\ J-oo 

where 

^uAy) = e°" ({e'' - 1) - !i±l±^ log (1 + tya-') 
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The the real part has maxima around y = 27['m for each integer m, but using log (1 + y^a^^) > 
\y'^Ci^'^ for TT < y < an and 1 + y'^a'^ > '^yct^^ for y > an as in jTB] gives 

exp {^pn,k{y)) dy = exp {^n,kiy)) dy + iexpi 

Next, note that 

iky /n + l + A;\n + ly2/l n+l + k \n + l 

where ""^i"*"^ = a„ + ke~°'" and e"" = ^^-^^ were used. Hence, 

y \ ik y"^ -sr^ /I /i\m n + l + /c \n+l/ zy 



^■-h^ h-^^-^ + E ;^ + (-i 



Making the change of variables and extending the sum of interval of integration gives 

OO / / gQ, 

exp (V'n,fc(y)) (iy + O exp 

.^ V V a^ 

ik -sr-^ ( \ n+ 1 + k \ n + \ I iy 



OO 



Hence, Taylor expanding around y = and using 

_j^ [o A; = l (mod 2) 

^'^' ^ ^2/ = <^ ^/2^^M^ k^Q (mod 2) 

gives the desired result. For more details see [18]. D 

Proposition 15.61 yields 

(5.7) ^ = ^^a;^fl- , '"" ^ V^(l + 0(e--)). 

5„ n! " V (r2 + l)(a„ + l); ^ ^ >> 

Direct application of this result gives the results in Theorems 13.31 and 13.51 

6. Proofs of Theorems [2l2] and 12.41 

This section gives the proofs of Theorems 12.41 and 12.21 This result implies Theorem 13.41 
A pair of lemmas which will be useful in the proof of Theorem 12.41 

Lemma 6.1. For Bn the Bell numbers, define 

n—k 



'^ fn- k\ 

4—n \ / 



where r, d, k, s are non-negative integers. Then gr,d,k,s{n) is a shifted Bell polynomial of lower 
shift index —k and upper shift index r + s — k. 



20 BOBBIE CHERN, PERSI DIACONIS, DANIEL M. KANE, AND ROBERT C. RHOADES 

Proof. It clearly suffices to prove that gr,o,k,s{n) is a shifted Bell polynomial. Since grfifi,s{n- 
k) = gr,o,k,s{n), it suffices to prove that gr,s{n) '■= gr,o,o,s{n) is a shifted Bell polynomial. 
For this consider the exponential generating function 

OO n / \ rn OO OO n-Uh 



n=0 n=0 i=0 ^ ^ a=0 6=0 



dx^ \ ^-^ n\ ] dx 

\n=0 / 

This is easily seen to be equal to e^"""^ times a polynomial in e^. On the other hand, the 
exponential generating function for go,s{n) = Bn+s (with S{s.,a) the Stirling number of the 
second kind) is 



dx'^ 

a=0 



ls,ale"'', 



which is e'^"'"^ times a polynomial in e^ of degree exactly s. From this, conclude that 
the space of all polynomials in e^ times e^"""^ is spanned by the set of generating func- 
tions for the sequences Bn+s as s runs over non-negative integers. In particular, e'^^e'^'""^ = 
^^0^0=0/^9."-^"+"^ ^'-'^ some rational numbers Pq^a- Since the generating function for 
gr,s{n) lies in this span. Moreover, 

r+s—k 



grfi,k,s{n) — 2_^ Oir+s,b+kBn+b 



b=-k 

for some rational numbers an,m- This gives the result. D 

For a sequence, r = {tq, ri, ■ ■ ■ , r^}, of rational numbers and a polynomial Q G Q[yi, ■ ■ ■ , |/fe, m] 
define 

k 

(6.1) M{k, g, r, n, x) := J] g(xi, . . . , x^, n) J](x + r,)^»+i-^-\ 

l<xi<3;2<---<2:j;<n i=0 

where xq = 0, Xk+i = n + 1. 

Lemma 6.2. Fixk, letQ G Z[|/i, ■ ■ ■ ,yk,m] andr = {ro,ri, ■ ■ ■ ,rk} be a sequence of rational 
numbers. As defined above, M{k,Q,r,n,x) is a rational linear combination of terms of the 
form 

F(r^)G'(x)(x + r,)"-^ 
where F G Q[n], G G Q[x] are polynomials. 

Proof. The proof is by induction on A;. If A; = then definitionally, M{k, Q, r, n, x) = 
Q{n){x + ro)", providing a base case for our result. Assume that the lemma holds for k one 
smaller. For this, fix the values of xi, . . . ,Xk-i in the sum and consider the resulting sum 
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over Xk- Then 

fc-2 

M{k,Q,r,n,x)= J] JJ^^ + ^^r^'~"'~' 

X Yl ^(^1' ■■■,^k,n){x + n-iT'^-^'-'-^x + rfc)"-^^ 

a;fc_i<Xfc<ra 

Consider the inner sum over x^'- 

If Tk-i = Tk, then the product of the last two terms is always {x + r^)"'"^''-!"^, and thus 
the sum is some polynomial in xi, . . . ,Xk-i,n times (a; + rfc)"~^''="^~^. The remaining sum 
over xi, . . . , Xk-i is exactly of the form M{k — 1, Q' , r', n — l,x), for some polynomial Q' , 
and thus, by the inductive hypothesis, of the correct form. 

If ^k-i 7^ ^k the sum is over pairs of non-negative integers a = x^ — x^-i — 1 and b = 
n — Xk — 1 summing to ra — Xk-i — 2 of some polynomial, Q' in a and n and the other Xi 
times {x + rk-iY^x + r^)^. Letting y = {x + Tk-i) and z = {x + r^), this is a sum of 
Q'{xi,n,a)y°'z'^. Let d be the a-degree of Q'. Multiplying this sum by {y — zY~^^, yields, 
by standard results, a polynomial in y and z of degree n — Xk-i — 2 + (rf + 1) in which 
all terms have either y-exponent or z-exponent at least n — Xk-i — 1. Thus this inner 
sum over Xk when multiplied by the non-zero constant {vk-i — VkY^^ yields the sum of a 
polynomial in x,n,Xi, . . . , Xk-i times {x + r^.i)""^*^-!"^ plus another such polynomial times 
(x + rfc_i)"^^'=-i~^. Thus, M{k,Q,T,n,x) can be written as a linear combination of terms 
of the form G{x)M{k — 1, Q', r', n, x). The inductive hypothesis is now enough to complete 
the proof. D 

Turn next to the proof of Theorem 12.41 



Proof of Theorem \2.4\ It suffices to prove this Theorem for simple statistics. Thus, it suffices 



to prove that for any pattern P and polynomial Q that 

M{fp,Q;n) = J2 fp,QW = E E ^(^) 
Aen„ Aen„ sepX 

is given by a shifted Bell polynomial in n. As a first step, interchange the order of summation 
over s and A above. Hence 

Mifp,Q;n)= E^(^) E 1- 

se[n]'= AGn(n) 

sSpA 

To deal with the sum over A above, first consider only the blocks of A that contain some 
element of s. Equivalently, let A' be obtained from A by replacing all of the blocks of A that 
are disjoint from s by their union. To clarify this notation, let Il'{n) denote the set of all 
set partitions of [n] with at most 1 marked block. For A' G Il'{n) say that s Ep A' if s in an 
occurrence of P in A' as a regular set partition so that additionally the non-marked blocks of 
A' are exactly the blocks of A' that contain some element of s. For A' G 11' (n) and A G n(n), 
say that A is a refinement of A' if the unmarked blocks in A' are all parts in A, or equivalently, 
if A can be obtained from A' by further partitioning the marked block. Denote A being a 
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refinement of A' as A h A'. Thus, in the above computation of M{fp^Q;n), letting A' be the 
marked partition obtained by replacing the blocks in A disjoint from s by their union: 



M{fp,Q;n)= Y^Qis) J2 Y. ^- 



seM*^ A'en'{n) Aen(n) 

sGpA' AhA' 



Note that the A in the final sum above correspond exactly to the set partitions of the marked 
block of A'. For A' G n'(n), let |A'| be the size of the marked block of A'. Thus, 



M{fp,Q;n)= J2 Qi') E ^l^'l- 



se[n]'= A'en'(n) 

sgpA' 



Remark. This is valid even when the marked block is empty. 

Dealing directly with the Bell numbers above will prove challenging, so instead compute 
the generating function 



|A'| 



M{P,Q,n,x):= J^ Q(') E ^ 

seN* A'Gn'(n) 

sGpA' 

After computing this, extract the coefficients of M{P, Q, n, x) and multiply them by the 
appropriate Bell numbers. 

To compute M{P,Q,n,x), begin by computing the value of the inner sum in terms of 
s = {xi < X2 < ■ ■ ■ < Xk). Denote the equivalence classes in P by 1, 2, . . . , £. Let Zi be a 
representative of this i*^ equivalence class. Then an element A' G 11' (n) so that s Ep A' can 
be thought of as a set partition of [n] into labeled equivalence classes 0,1, ... ,i, where the 
0*'' class is the marked block, and the i*'' class is the block containing Xz^. Thus think of the 
set of such A' as the set of maps g : [n] ^ {0, 1 , . . . , i} so that: 

(1) g{xj) = i ii j is in the i*^ equivalence class 

(2) g{x) ^ i ii X < Xj, j E First (P) and j is in the i*^ equivalence class 

(3) g{x) ^ i ii X > Xj, j E Last(P) and j is in the i*^ equivalence class 

(4) g{x) ^ i ii Xj < X < Xji, {j,j') G Arc(P) and j,j' are in the i*^ equivalence class 

It is possible that no such g will exist if one of the latter three properties must be violated by 
some X = Xh- If this is the case, this is a property of the pattern P, and not the occurrence 
s, and thus, M{fp^Q;n) = for all n. Otherwise, in order to specify g, assign the given 
values to g{xi) and each other g{x) may be independently assigned values from the set of 
possibilities that does not violate any of the other properties. It should be noted that is 
always in this set, and that furthermore, this set depends only which of the Xi our given x 
is between. Thus, there are some sets 5*0, 5*1, . . . , 5*^ C {0, !,...,£}, depending only on s, so 
that g is determined by picking functions 

{l,...,Xi-l}^So,{xi + l,...,X2-l}^Si,...,{xk + l,...,n}^Sk- 

Thus the sum over such A' of x''^'' is easily seen to be 

(x + ror-\x + ur-^^-' ■■■{x + rfc_i)^'=-^'=--i(a; + rfc)"-^\ 
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where r^ = |S'j| — 1 (recall \Si\ > 0, because G Si). For such a sequence, r of rational 
numbers define 



(6.2) M(A;,Q,r,n,x):= J^ Q(xi, . . . ,Xfc,n) J](a; + ri)^«+- 

l<xi<X2<...<xi^<n i=0 



Xi-1 

5 



where, as in Lemma [6l2l using the notation xq = 0, Xk+i = n+1. By Lemma [672l M{k, Q, r, n, x) 
is a linear combination of terms of the form F{n)G{x){x + ri)^~^ for polynomials F G Q[n] 
and G G Q[x]. Thus, M{fp^Q]n) can be written as a linear combination of terms of the 
form gr,d,e,s{n) where i is the number of equivalence classes in P and r, d, s are non- negative 
integers. Therefore, by Lemma [6.11 M(fpn: n) is a shifted Bell polynomial. 

To complete the proof of the result it is sufficient to bound the lower shift index of the 
Bell polynomial. This follows from the fact that wi/,n = 0{n^Bn) and by 05. 7p each term 
n°'Bn+l3 is of an asymptotically distinct size. By 06. 2p it is clear the largest power of x in 
each term is {n — k). Thus, from Lemma 16.1^ the resulting shift Bell polynomials can be 
written with minimum lower shift index —k. This completes the proof. D 

Next turn to the proof of Theorem 12. 2[ To this end, introduce some notation. 

Definition 6.3. Given three patterns Pi, P2, P3, of lengths ki, k2, k^, say that a merge of Pi 
and P2 onto P3 is a pair of strictly increasing functions mi : [ki] — )■ [k^], 77^2 : [^2] -^ [ks] so 
that 

(1) mi([A;i])Um2([A;2]) = [A;3] 

(2) 7711(2) ~P3 mi{j) if and only if i ~pj j, and m2{i) ~P3 7712(7) if ^^id only if i r^p^ j 

(3) i G First(P3) if and only if there exists either a j G First(Pi) so that i = mi{j) or 
a j G First (P2) so that i = 7712(7) 

(4) i G Last(P3) if and only if there exists either a j G Last(Pi) so that i = mi{j) or a 
j G Last(P2) so that i = 7712(7) 

(5) (z, i') G Arc(P3) if and only if there exists either a (j, j') G Arc(Pi) so that i = rrii {j) 
and i' = mi{j') or a {j,j') G Arc(P2) so that i = 777,2(7) &iid i' = 7712(7') 

Such a merge is denoted as 1711,1712 '■ Pi, P2 — ?• Ps- 

Note that the last three properties above imply that given Pi and P2, a merge (including 
a pattern P3) is uniquely defined by maps mi,ni2 and an equivalence relation ~P3 satisfying 
(1) and (2) above. 

Lemma 6.4. Let Pi and P2 be patterns. For any A there is a one-to-one correspondence: 

(6.3) {(si, S2) : si Ep^ A, S2 Ep^ A} ^ {P3, S3 Ep^ A, and 7711,7712: Pi, P2 -^ P3} . 
Moreover, under this correspondence 

Qmi,m2,Qi,Q2{^3) •=Ql(^mi(l)5 ^mi(2)5 • • • 5 Zmj^(^ki)j n)Q2{Zm2{l), Zm2{2), • • • 5 2:^2(^2))''^) 

(6.4) =Qi{si)Q2{s2). 

Proof. Begin by demonstrating the bijection defined by Equation 06. 3p . On the one hand, 
given S3 Ep^ A given hj zi < Z2 < ■ ■ ■ < Zk^ and mi, 1712 : Pi, P2 — ;• P3, define Si and S2 by 
the sequences ^^^(i) < Zmi{2) < ... < Zm^^ki) and ^^2(1) < ^^2(2) < . . . < ^maCfca)- It is easy to 
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verify that these are occurrences of the patterns Pi and P2 and furthermore that equation 
(I6.4p holds for this mapping. 

This mapping has a unique inverse: Given Si and S2, note that S3 must equal the union 
S1US2. Furthermore, the maps iria., for a = 1, 2, must be given by the unique function so that 
fTT'aii) = j if and only if the i^^ smallest element of Sa equals the j*^ smallest element of S3. 
Note that the union of these images must be all of [^3]. In order for S3 to be an occurrence 
of P3 the equivalence relation ~P3 must be that i ^p,^ j if and only if the i^^ and j*'^ elements 
of S3 are equivalent under A. Note that since 5*1 and 5*2 were occurrences of Pi and P2, that 
this must satisfy condition (2) for a merge. The rest of the data associated to P3 (namely 
First (P3), Last (P3) and Arc(P3)) is now uniquely determined by mi,m2,Pi,P2 and the 
fact that P3 is a merge of Pi and P2 under these maps. To show that S3 is an occurrence 
of P3 first note that by construction the equivalence relations induced by A and P3 agree. If 
i G First(P3), then there is a j G First(Pa) with i = ma{j) for some a,j. Since Sa is an 
occurrence of Pa, this means that the j*^ smallest element of Sq in in First(A). On the other 
hand, by the construction of m,a, this element is exactly Zm^i^j) = Zi. This if i G First(P3), 
Zi G First (A). The remaining properties necessary to verify that 6*3 is an occurrence of P3 
follow similarly. Thus, having shown that the above map has a unique inverse, the proof of 
the lemma is complete. D 

Recall, the number of singleton blocks is denoted Xx and it is a simple statistic. To 
illustrate this lemma return to the example of X\ discussed prior to the lemma. Let Pi = P2 
be the pattern of length 1 with Arc (Pi) = 0, First (Pi) = Last (Pi) = 1. Then there are 
five possible merges of Pi and P2 into some pattern P3. The first choice of P3 is Pi itself. 
In which case mi(l) = ^2(1) = 1- The latter choices of P3 is the pattern of length 2 with 
First(P3) = Last(P3) = {1, 2}, Arc(P3) = 0. The equivalence relation on P3 could be either 
the trivial one or the one that relates 1 and 2 (though in the latter case the pattern P3 will 
never have any occurrences in any set partition). In either of these cases, there is a merge 
with rrii(l) = 1 and m2(l) = 2 and a second merge with mi(l) = 2 and vn^iX) = 1- As a 
result, 



M{xi-n)= J2 M>^r= E 



AGn(n) 



A6n(n) 



/ 



\ 



E 



1 



XI 

a::iGFirst(A) 
\ xieLast(A) / 



E E Ei^^E E iH-E El 



Aen(n) 3;'i yi 

xieFirst(A) j/ieFirst(A) 
xieLast(A) yieLast(A) 



Aen(n) 3::i<i'2 

a:i,a::2SFirst(A) 
a;i,X2£Last(A) 



Aen(n) -^'i ,^^ 

xiGFirst(A) 

a;iSLast(A) 



Proof of Theorem \2.^A The fact that statistics are closed under pointwise addition and scal- 
ing follows immediately from the definition. Similarly, the desired degree bounds for these 
operations also follow easily. Thus only closure and degree bounds for multiplication must 
be proved. Since every statistic may be written as a linear combination of simple statistics 
of no greater degree, and since statistics are closed under linear combination, it suffices to 
prove this theorem for a product of two simple statistics. Thus let ft be the simple statistic 
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E 



E 



defined by a pattern Pi of size ki and a polynomial Qi. It must be shown that /i (A) /2(A) is 
given by a statistic of degree at most ki + k2 + deg{Qi) + deg(Q2)- 
For any A 

/l(A)/2(A) = Yl Qli^l)Q2{s2). 

Simplify this equation using Lemma 16. 4[ writing this as a sum over occurrences of only a 
single pattern in A. 
Applying Lemma [6. 4[ 

/i(A)/2(A) = Yl ^1(^1)^2(^2) 

siep-^X,S2£p2X 
Pz mi,m2:Pi,P2'^P3 s^ep-^X 

~ / J jP3,Qmj^,m2,Qi,Q2^ )' 

mi,m2:Pi,P2^Pi 

Thus, the product of /i and /2 is a sum of simple characters. Note that the quantity is a 
polynomial of S3 which is denoted Qmi,m2,Qi,Q2{^3)- Finally, each pattern P3 has size at most 
ki + ^2 and each polynomial Qmi,m.2,Qi,Q2 has degree at most deg{Qi) + deg{Q2)- Thus the 
degree of the product is at most the sum of the degrees. D 

7. More Data 

This section contains some data for the dimension and intertwining exponent statistics. 
The moment formulas of Theorem 13.21 for k < 22 and the moment formulas for the inter- 
twining exponent for k < 12 have been computed and are available at [50]. Moreover, the 
values /(n, 0, B) for n < 238 and f{i){n, 0, B) for n < 146 are available. These sequences can 
also be found on Sloane's Online Encyclopedia of integer sequences [54]. 

The remainder of this section contains a small amount of data and observations regarding 
the distributions f{n,0,B) and /(j)(n, 0,i?) and regarding the shifted Bell polynomials of 
Theorems 13.21 and 13.41 

7.1. Dimension Index. 
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Figure 4. A table of the dimension exponent f{n, 0, d). 
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A couple of easy observations: It is clear that 



■)«— 1 



/(n,0,0) 

That is the number of set partitions of [n] with dimension exponent is 2"~^. Set partitions 
of [n] that have dimension exponent must have n appearing in a singleton set or it must 
appear in a set with n — 1, thus the result is obtained by recursion. Additionally, the number 
of set partitions of [n] with dimension exponent equal to 1 is 722"""^, that is 

/(n,0,l) = n2"-i. 

Curiously, the numbers /(n, 0, B) are smooth (roughly they have many small prime factors), 
for reasonably sized B. This can be established by using the recursion of Theorem 14.11 For 
example, 

/(lOO, 0, 979) =2^^ ■ 3^ • 5^ • 7^ • 11 • 797 • 12269 ■ 12721 

■ 342966248369 ■ 2647544517313 ■ 1641377154765701 

■ 16100683847944858147992523687926541327031916811919 
/(lOO, 0, 2079) =2^^ ■ 3^^ ■ 5^ ■ 7^ ■ 11^ ■ 13^ • 17 ■ 19 ■ 23 ■ 29 ■ 31 • 24679914019 

■ 58640283519733 ■ 194838881932339884007114639638682100019517 

Note that /(lOO, 0,979) has 111 digits and /(lOO, 0, 2079) has about 100 digits. 

In the notation of Theorem 13.21 these are some values of the first few moments of d{X). 



P_{3,0}(ii 
P_{3,l}(ii 
P_{3,2}(ii 
P_{3,3}(ii 
P_{3,4}(ii 
P_{3,5}(ii 
P_{3,6}(ii 

P_{4,0}(n 
P_{4,l}(ii 
P_{4,2}(ii 
P_{4,3}(ii 
P_{4,4}(n 
P_{4,5}(ii 
P_{4,6}(ii 
P_{4,7}(n 
P_{4,8}(ii 

P_{5,0}(n 
P_{5,l}(n 
P_{5,2}(n 
P_{5,3}(n 
P_{5,4}(n 
P_{5,5}(n 



+ln 

-1/3 +6n +3n"2 

+8/3 -45n -12n-2 

+18 +51n +12n-2 +ln-3 

-131/3 -45n -6n"2 

+42 +12I1 



+ln +3n-2 

-21/2 -18ii -20n"2 

-36 +116/3n +72n~2 +6n-3 

-5/6 -166/3n -162n-2 -24n-3 

-103/3 +312n +150n-2 +16n-3 +ln-4 

-81/2 -812/3n -90n-2 -8n~3 

+409/3 +168n +24n-2 

-104 -32n 

+ 16 

+ln +10n^2 

+1036/15 +50/3n -35n~2 +15n"3 

-1373/30 +95/2n -180n"2 -llOn-3 

+4415/3 -1370/3n +2030/3n"2 +300ii"3 +10ii~4 

+47/2 -605/6n -2350/3n~2 -390n~3 -40ii-4 

+ 1049/3 +15n +1380n"2 +330ii"3 +20n-4 +ln-5 
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P_{5,6}(n) = +4673/30 -2485/2n -2750/3n-2 -150n-3 -10n"4 
P_{5,7}(n) = -95/3 +3005/3n +420n~2 +40n^3 
P_{5,8}(n) = -1010/3 -520n -80n~2 
P_{5,9}(n) = +240 +80n 
P_{5,10}(n) = -32 

P_{6,0}(n) = +ln +25n"2 +15n^3 

P_{6,l}(n) = +1655/6 +185/6n -309n^2 -120n^3 

P_{6,2}(n) = -661817/90 -17539/15n +1015n-2 +495n"3 +45n"4 

P_{6,3}(n) = +149203/45 +12779/10n +1935/2n~2 -1770n-3 -340n"4 

P_{6,4}(n) = -1118236/45 +36605/3n -3460n~2 +10420/3n"3 +840n"4 +15n"5 

P_{6,5}(n) = -121658/9 +3887/2n -8385/2n^2 -11450/3n"3 -765n"4 -60n"5 

P_{6,6}(n) = -1547/9 +1133n +3485n^2 +3960n-3 +615n"4 +24n-5 +ln-6 

P_{6,7}(n) = -38697/10 +7573/5n -13695/2n"2 -6940/3n-3 -225n"4 -12n"5 

P_{6,8}(n) = -12653/90 +3410n +3965n^2 +840n^3 +60n-4 

P_{6,9}(n) = +665 -2980n -1560n~2 -160n~3 

P_{6,10}(ii) = +2060/3 +1440n +240n^2 

P_{6,ll}(n) = -528 -192n 

P_{6,12}(n) = +64 

These formulae exhibit a number of properties. Here is a hst of some of them. 

(1) Using the fact that B^+k ~ n'^B^, each moment has a number of terms with asymp- 
totic of size equal to n^^Bn, up to powers of log(n) (or a„). Call these terms the 
leading powers of n. The leading 'power' of n contribution is equal to 



[n - 2TfBr, 



-k 



where T is the operator given by TB^ = -Bm+i- Foi' example the leading order n 
contributions for the average is 

nBn+i — 2Bn+2 
and the leading order contribution for the second moment is 

n^Bn+2 - 4:nBn+3 + ABn+A- 

The next remark also concerns this sort of cancelation. 
(2) The next order n terms have size roughly n'^^~^Bn and have the shape 



where the constants Cj are 



c, = v-\n-3)3. 



(3) The generating function for the polynomials Po,A:(^) seems to be 

$^Po,feH^ = exp ((e^ - l-X)n) . 

fc>0 

We do not have a proof of this observation. 
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As in the introduction, let Sk{d] n) := XIag^ ('^(-^) ^ '^^id; n) j . From Proposition 15.61 

and the formulas for M{d^\ n) deduced from Theorem 13. 21 and stated in Section mi and using 
SAGE the asymptotic expansion of the first few 5*^ are: 



aj - 7an + 17 3 
S2{d]n) = — : — — - — n 



+ 



-8al - 29af, - ISGal - 207at + Q9al + AOYal + 116a„ - 80 



2<(a„ + 1)^ 



n^ + 0{n) 



^ , , , Qal- 83a3 ^ 435^2 _ 732^^ 
S3{d;n) = 



5*4 (d; n) =3 



q;„ - 7q;„ + 17 



3(«„ + l)3a^ 
2 



^1 4 , ^ / ^" 



at 



7f + 



n 



at 



S^{d]n) = — 



10 fal - Ian + 17\ /6< - 83«3 + 435^2 _ 732«„ - 881 



3 V (C^n + l)"n 



(a„ + l)3a4 



"'^"'ii 



S'6((i; n) =15 



aS - 7ar,, + 17 



t„ I ij;„ 



("n + l)a^ 



n^ + O 



n 






Remark. These asymptotics support the claim that the dimension exponent is normally 

. This result will 



distributed with mean asymptotic to j^p?^ and standard deviation y ^^ . „ 
be established in forthcoming work [14j. 

7.2. Intertwining Index. Table |5] contains the distribution for of the intertwining exponent 
for the first few n. 
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Figure 5. A table of the distribution of the intertwining exponent /(j)(n, 0, B). 



In the notation of Theorem 13.41 these are some values of the first few moments of i{X). 
Q_{3,0}(ii) = +19/192 -29/96n +l/16n-2 +l/8n-3 
Q_{3,l}(n) = +331/192 -193/96n -17/16n"2 +3/8n"3 
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Q_{3,2}(n 
Q_{3,3}(n 
Q_{3,4}(n 
Q_{3,5}(n 
Q_{3,6}(n 



Q_{4, 
Q_{4, 
Q_{4, 
Q_{4, 
Q_{4, 
Q_{4, 
Q_{4, 
Q_{4, 
Q_{4, 



OXn 
l}(n 
2}(n 
3}(ii 
4}(ii 
5}(ii 
6}(ii 
7}(n 
8}(n 



Q_{5,0}(ii; 
Q_{5,l}(ii; 
Q_{5,2}(ii; 
Q_{5,3}(ii; 
Q_{5,4}(ii; 
Q_{5,5}(ii; 
Q_{5,6}(ii; 
Q_{5,7}(ii; 
Q_{5,8}(ii; 
Q_{5,9}(ii; 
Q_{5,10}(n) = 



-25/6 -743/96x1 -l/4n~2 +3/8n~3 
+775/64 +449/96n -l/16n"2 +l/8n"3 
-451/32 -619/96n -15/16n~2 
+2045/192 +75/32n 
-125/64 

+4387/172800 +103/360n -ll/32n-2 0n"3 +l/16n^4 
-3343/10800 +787/144n -7/16n~2 -7/4n-3 +l/4n-4 
-25453/3456 +7777/288n -335/48n"2 -23/8n-3 +3/8n^4 
-16681/8640 -4303/288x1 -49/8x1-2 -9/8n-3 +1/4x1-4 
+963509/34560 +23891/480x1 +6xi-2 -5/8n-3 +1/16x1-4 
-637751/14400 -8197/288x1 -53/12x1-2 -5/8ii-3 
+ 126773/3456 +1745/96x1 +75/32x1-2 
-3425/192 -125/32x1 
+625/256 

-107993/138240 -593/69120x1 +569/1152x1-2 -175/576x1-3 -5/192x1-4 +1/32x1-5 
-79109/27648 -67769/7680x1 +9859/1152x1-2 +995/576x1-3 -115/64x1-4 +5/32x1-5 
+5436923/138240 -1228273/11520x1 +5925/128x1-2 +815/96x1-3 -925/192x1-4 +5/16x1-5 
-29849/512 -92287/6912x1 +1375/16x1-2 -65/32x1-3 -415/96x1-4 +5/16x1-5 
+ 1825783/27648 -2270759/13824x1 -6497/576x1-2 +865/288x1-3 -105/64x1-4 +5/32x1-5 
-1092827/138240 +9971653/69120x1 +21119/288x1-2 +725/96x1-3 -145/192x1-4 +1/32x1-5 
-14859283/138240 -6747031/34560x1 -44905/1152x1-2 -1145/576x1-3 -25/64x1-4 
+ 188749/1536 +656965/6912x1 +7225/384x1-2 +125/64x1-3 
-2168275/27648 -61625/1536x1 -625/128x1-2 
+258125/9216 +3125/512x1 
3125/1024 



We conjecture that Qj 



n] 



The calculations for the Si 
above with ( 15. 7p give 

Zal - 22a„ + 56 



for all J < 0. 

{i]n) := ;§;;; 2_.Aen(n) 



i(A) — M{i]n)) . The formulas stated 



5*2 (i;n) 



9a3(a„ + 1) 



-n 



+ 



IQaL - 52a° - 2Mol - Ihbat 



l2Qat 



12al + 230a„ + 175 



S' 



3{^n) 



{an - 5)(4a^ - 31a^ + lOOa^ 



(a„ + 1)4 
+ 99) 



-n 



+ 0{n) 



5*4 (i; n) =3 



3a^ 



8a^(a„ + 1 
22a„ + 56)2 



9al{an + l) 



-n 



+ 



n" 



a-i 



n 



O 



n 



ai 



S^{v,n) =5 



{an - 5)(3a2 - 22a„ + 56) (4a^ - 310^ + 100a„ + 99) 



SQ{i;n) =15 



3a^-22a„ + 56)2^^ 
9al{an + 1) 



36al{an + 1)^ 
n^ + 0{n^) 



n 



+ O (n^) 
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